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The overwhelming success of online social networks, the key actors in the Web 2.0 cosmos, has 
reshaped human interactions globally. To help understand the fundamental mechanisms which 
determine the fate of online social networks at the system level, we describe the digital world as 
a complex ecosystem of interacting networks. In this paper, we study the impact of heterogeneity 
in network fitnesses on the competition between an international network, such as Facebook, and 
local services. The higher fitness of international networks is induced by their ability to attract 
users from all over the world, which can then establish social interactions without the limitations of 
local networks. In other words, inter-country social ties lead to increased fitness of the international 
network. To study the competition between an international network and local ones, we construct a 
1:1000 scale model of the digital world, consisting of the 80 countries with the most Internet users. 
Under certain conditions, this leads to the extinction of local networks; whereas under different 
conditions, local networks can persist and even dominate completely. In particular, our model 
suggests that, with the parameters that best reproduce the empirical overtake of Facebook, this 
overtake could have not taken place with a significant probability. 

Keywords: complex systems — complex networks — online social networks — digital ecology — digital 
world — network of networks — double mean field approximation 


I. INTRODUCTION 

The rapid growth of online social networks (OSNs), 
such as Twitter or Facebook, led to over two billion 
accounts being active in 2014 [T], connecting over one 
quarter of the world’s population and 72% of online U.S. 
adults [2]. Bridging the gap between social sciences, and 
information and communication technologies, OSNs con¬ 
stitute a crucial building block in the development of in¬ 
novative approaches to the challenges society faces today. 
However, technological progress over the last decade has 
dramatically outpaced our understanding of the new sys¬ 
tems and their impact on society. 

This lack of understanding of the complex dynamics 
of the digital world reveals the pressing need for a com¬ 
prehensive and concise theory to describe and model the 
online world as a set of interacting networks. In this con¬ 
text, the activity of users has become a resource to be 
fought over and which drives competition in the digital 
world. Digital services only persist if they can attract 
and maintain users’ attention. Here, we describe the on¬ 
line world as a complex, digital ecosystem in which in¬ 
teracting networks constitute species in competition for 
survival. Within this ecological context, the extinction of 
a network corresponds to the complete absence of activ¬ 
ity, as an entirely passive network cannot function and, 
more importantly, will not attract new users. In a recent 
study [3], we demonstrated that a moderate number of 
identical networks can coexist in the digital ecosystem, 
in contrast to the principle of competitive exclusion 

In contrast to that previous work, in this paper we ad¬ 
dress the heterogeneity of networks. Networks can differ 
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in functionality, features, and -most importantly- they 
can address different peer groups. Here, we show how 
the effect of different overlapping peer groups can be de¬ 
scribed in terms of different degrees of network fitness. 
We find that under certain conditions, the heterogeneity 
of degrees of htness can impede coexistence which would 
indeed be possible for identical networks. This effect is 
particularly important for the competition between local 
networks and an international network. Unlike users of 
local networks, users of the international network have 
the possibility to interact with people in other countries, 
providing this network an advantage over local ones, sim¬ 
ilar to a higher fitness of a certain species. A proper mod¬ 
eling of this effect requires taking into account the net¬ 
work of interactions among countries in the world, which 
results in a highly complex and non-linear dynamical sys¬ 
tem made of interconnected multilayer networked entities 
- we are hence dealing with networks of multiplex net¬ 
works [SHE], in contrast to Ref. [5]. Besides, inter-country 
interactions induce a different type of bifurcation as the 
symmetry of the previous model is broken, which consti¬ 
tutes a fundamentally new behavior not observed in [3]. 

Empirical observations have shown that Facebook ex¬ 
panded massively in the middle of the first decade of this 
century, starting in the US, when local networks were 
the most popular services in most countries. Only a few 
years later, Facebook had become the most popular net¬ 
work in most countries. So, is the fate of the digital 
world to become dominated by a single “big brother” as 
it takes over all our digital interactions? Alternatively, is 
digital diversity possible from a system-level perspective? 
In this paper, we show that due to the nonlinear char¬ 
acter of our model, the answer to both questions can be 
positive or negative depending on a range of parameters 
and, quite surprisingly, depending on chance. As we will 
show, our model, despite its simplicity and the limited 
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number of parameters, is able to describe surprisingly 
well the complex behavior of globally interacting online 
social networks. 


II. RESULTS 

A. Complex organization of the digital world 

The digital world consists of highly connected and 
strongly coupled interacting subsystems. These basic 
building blocks are single networks, each of which obeys 
specific dynamics in the absence of coupling to the whole 
system. So the complexity of the digital world is a con¬ 
sequence of both the dynamics of networks in isolated 
environments and the interactions between many such 
networks. Finally, not all of these building blocks are 
identical. Instead, different networks address different 
peer groups or have different functionalities. Hence, to 
reveal the fundamental mechanisms that determine the 
fate of the digital world, it is necessary to understand 
the interaction of heterogeneous networks, each driven 
by intrinsic dynamics. 

B. Isolated dynamics of online social networks 

The key actors in the digital world are OSNs; loosely 
defined as web-based platforms that enable digital so¬ 
cial interactions over the Internet. However, societies 
were organized as networks long before OSNs were even 
thought of. From this point of view, the growth of OSNs 
can be described through the dynamical processes by 
which people in the traditional offline social structure 
come to engage in OSNs. The topology of the OSN is 
now the digital counterpart of the underlying offline so¬ 
cial network [siiini. 

In isolation, this process of formation can be described 
by a set of simple dynamical mechanisms [5]. The system 
is initially given by an empty OSN and the underlying 
social structure. Individuals can be in three different 
states: active, passive or susceptible. While active and 
passive nodes exist in the online as well as the offline net¬ 
works, susceptible nodes are only present in the latter. 
A susceptible node can join the OSN via two different 
mechanisms: a viral activation effect which means that 
a susceptible node becomes active due to the influence of 
an active neighbor in the offline network and a mass me¬ 
dia effect, which represents the spontaneous activation 
of a susceptible node. In addition, an active node can 
become passive spontaneously {deactivation) and a pas¬ 
sive node can become active again due to the influence 
of an active neighbor {viral reactivation). See Fig. 
for a visualization of these mechanisms. Notice that, in 
the long time limit, when the number of susceptible in¬ 
dividuals is basically zero, this dynamics is equivalent to 
the susceptible-infected-susceptible model widely used in 
epidemiology m- As found in mm, this implies that 


online social networks can either exhibit a sustained ac¬ 
tivity (similar to the endemic phase), or they can become 
entirely passive (similar to the healthy phase). 

The evolution of OSNs rarely takes place in isolation. 
Nevertheless, we found a perfect case study in the Slo¬ 
vakian social network “Pokec”, which, due to the particu¬ 
larities of the country, has been growing in quasi-isolation 
for more than ten years. By analyzing the evolution of 
the topology of the social contact graph of “Pokec”, we 
were able to rigorously validate our model. Quite re¬ 
markably, with only two parameters, the model repro¬ 
duced the entire topological evolution with astonishing 
precision. 

Viral activation and viral reactivation occur at the 
same rate. A, and the ratio between this rate and the 
rate of mass media influence, n, governs the topological 
evolution. In particular, we observed that the real sys¬ 
tem underwent a dynamical percolation transition; that 
is, a phase transition between a disconnected phase and 
a phase in which a macroscopic fraction of the system is 
connected. The position of this transition is controlled 
by A//i, due to the complementary roles that the viral 
activation and mass media effects play in the topologi¬ 
cal evolution of the network (the former tends to connect 
components; whereas the latter tends to create new com¬ 
ponents). Finally, without loss of generality, we set the 
deactivation rate i5 to 1 which is equivalent of fixing the 
timescale of the model. 

To sum up, in our previous work j^, we were able 
to rigorously validate the dynamics ruling OSNs in isola¬ 
tion; the fundamental building blocks of the digital world. 
These findings constitute the foundation for the devel¬ 
opment of a more comprehensive theory of interacting 
heterogeneous networks. 


C. Competitive interaction between multiple 
networks 

The simultaneous existence of multiple digital services 
in competition for the attention of users suggests an eco¬ 
logical perspective from which to explain the prevalence 
of a given network or the coexistence of multiple net¬ 
works. In ecology theory, the principle of competitive 
exclusion [3] states that multiple species in competition 
for the same resource cannot coexist, as even the slight¬ 
est advantage of one species over the other is successively 
amplified; a mechanism referred to as “rich get richer” or 
preferential attachment nnn]. This eventually leads to 
the extinction of the inferior species. 

The key principle that drives the competition between 
OSNs is the fact that, due to the physical and cogni¬ 
tive limitations of users, the time they devote to online 
activities is limited. As a consequence, the viral param¬ 
eter, A, constitutes a conserved quantity that is never¬ 
theless distributed between the competing networks as 
Xi = 0 Ji{p^)\, where 0 Ji{p^) represents a normalized set 
of weights, that is 0 Ji{p^) = 1, and p^ = {p\, p|,...) is 
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Figure 1. Constituents of our model, a) Design of the international network and local networks, b) Sketch of our model using 
coarse-grained coupling, c) Visualization of the flight network. The size of the nodes is proportional to the number of users in 
the respective countries with Internet access. The transparency and thickness of the links represents the density of passengers 
between the countries concerned, d) Illustration of the competition between the international network and the local network 
within one country. 


a vector denoting the fraction of active nodes (activities) 
in the different networks. In general, users are more likely 
to subscribe to and engage in networks that are more ac¬ 
tive. Therefore, the viral activity of each network must 
be a function of the activity of the network itself. In par¬ 
ticular, we model this by assuming that the weighting 
oji{p^) is a function, such that duji{p‘^)/dpf >0. In [3] 
we proposed the particular form: 




[Pf] 


a 


E ni 



(1) 


were n/ denotes the number of networks. This choice al¬ 
lows us to interpolate between a set of independent net¬ 
works (cr <C 1) and highly coupled ones (cr 1). The 
activity affinity parameter, cr, then quantihes the ten¬ 
dency of users to subscribe to or engage in more active 
networks. Interestingly, in contrast to the principle of 
competitive exclusion, multiple networks can coexist be¬ 
cause the “rich get richer” mechanism is damped by the 
diminishing returns of the dynamics of network evolu¬ 
tion. For details, we refer the reader to Ref. [3]. In the 


following, we take into account the heterogeneity of net¬ 
works induced by different groups of individuals that can 
subscribe to the different networks. These aspects are 
not discussed in Ref. [3] and have important implications 
and applications, as we will show. 


D. Network heterogeneity leads to effective activity 

As mentioned above, since its official launch in 2004, 
Facebook has become the most popular OSN in most 
countries; even in countries where there was already a 
popular OSN before Facebook was launched. To mimic 
the real evolution of the digital ecosystem at the world¬ 
wide scale, we assume that one local network exists in 
each country in addition to a globally operating, interna¬ 
tional network (see Fig. [^). In the US, both networks 
are launched at the same time; whereas the international 
network is launched with a delay At in the remaining 
countries, to take into account the initial prevalence of 
local networks. 
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Once launched, the international network provides the 
user with the possibility to connect to individuals in dif¬ 
ferent countries, in contrast to local networks, making it 
more attractive to users. For a given country, the ad¬ 
vantage of the international network is directly related 
to the abundance of social ties between that country and 
the rest of the world. We use passenger air travel data 
as a proxy for the abundance of such ties. This choice is 
justified by the strong correlation between air travel flows 
and further measures of inter-country exchange, for in¬ 
stance email communication [22] or Twitter activity |23j . 

Users in country i experience the greater attractiveness 
of the international network as they perceive its activity 
with respect to the population of their own country and 
also with respect to their contacts in other countries. To 
account for this on a coarse grained level, in Eq. Q, we 
replace the activity of the international network by an 
effective activity as follows 

Pi, int ~ Pi.int + ^ ^ ' ^ijPj, \nt J (2) 

3 

where 


by a network of networks to a set of evolution equations 
of the average activity in the international network and 
in local networks. As we show in the following section, 
the results of the full model with heterogeneous topolo¬ 
gies exhibits similar behavior to that encountered by the 
double meanfield approximation. 

The first meanfield approximation consists of assum¬ 
ing a fully mixed homogeneous population in each coun¬ 
try. Let p^i denote the fraction of active users in net¬ 
work I G (loc, int) in country i and pf ; the fraction of 
nodes susceptible to joining this network. Then, the frac¬ 
tion of passive users is given hy 1 — ph — pf [. As ex¬ 
plained above, in each country the virality is distributed 
between the local and international network via the 
weight functions uji^dph^^, Phnt) and uji^tip^^^, = 
l-a;ioc(p^joc^PMnt). as introduced in Eq. Q. Here, 
denotes the effective activity of the international network 
as defined in Eq. (§. The evolution equations of the re¬ 
sulting system represent a generalization of the evolution 
equations for identical networks which we derived in [3], 
where one replaces the activity of the international net¬ 
work with the effective activity from Eq. (H). This pro¬ 
cedure yields: 


max[Wij/Ni] 


Ph = (k) Pi,int) [1 - Ph] - ^^+P■^,lPi,l 


denotes the fraction of the number of air travel passen¬ 
gers between countries i and j, Wij; and Ni, the popula¬ 
tion of country i. Notice that, in an ecological context, 
this corresponds to increased fitness of the international 
network. In Eq. we have implicitly assumed pro¬ 
portionality between the number of passengers and the 
number of contacts in the respective countries, namely 
Nij oc Wij. Einally, note that the arbitrary normaliza¬ 
tion in Eq. serves the sole purpose of ensuring that 
reasonable values for the parameter a are of the order of 
unity. 

Hereafter, we decompose the international network 
into a set of disjunct coupled subnetworks operating in 
each country and in competition with the respective local 
network (see Fig. [^). These subnetworks are neverthe¬ 
less not independent, as they are globally coupled via the 
effective activity defined in Eq. and ultimately by the 
network representing the inter-country social ties. Hence, 
our model forms a network of networks [SHU, where each 
node in Fig. represents a three-layer multiplex net¬ 
work [23112S] in which the bottom layer corresponds to 
the underlying social structure and the two upper layers 
denote the local and international networks operating in 
the respective country (see Fig. Bi)- 


E. Double meanfield approximation reveals 
complex role of the activity affinity 


Pi,l — ~Pi,lPi,l 


1 + (k) p 



(4) 


As shown in Hi, we further assume the the same linear 
relationship between virality and media influence in each 
country, that is: 


Pi,loc 


pi,int 


^^loc(pf,loc^Pf,int) 

u 

Aa;int(p^jocJ Pi,int) 

V 


(5) 


As shown in |3| , the value of v does not affect the stability 
of the system. In what follows, we perform the stability 
analysis in the limit v ^ oo. This decouples the evolution 
of p^i from p®;, so that we only have to consider 
Plugging in the weights function defined in Eq. Q and 
the effective activity from Eq. ([^ yields the evolution 
equations for the activities of the local and international 
networks in country i: 


Pi,\oc Pi,\oc 


Pi,int Pi,int 


tloc]" 


A (fc) [Ptoc. r. a 1 . 

Kloc]" + (Pf.int+<50'^^ 

^ (fc) (Pf.int + 


Kloc]'" + (Pf.int + ^i)' 


:[1 - PLnt] - 1 


( 6 ) 


To understand the qualitative behavior of the system, 
in this section we present a double meanfield approxi¬ 
mation of the system. This reduces the system given 


where (5* = a HiiPyinf 

The second meanfield approximation consists of apply¬ 
ing the hypothesis of a fully mixed homogeneous network 
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Figure 2. Bifurcation diagram and stream plots for the double meanfield approximation 0 for A (fc) = 3.5 and v —>■ oo. The 
basins of attraction for the domination of the international network are in blue, the basins of attraction for the domination of 
local networks are in red; the white areas correspond to the basins of attraction of the coexistence solution (if it exists). 
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for the inter-country social ties. We use = a {ilij) 
and define the mean activity of the local networks as 
X = the mean activity of the international 

network as y = (pfjnt)- Finally, our double meanfield 
approximation leads to the following system of coupled 
differential equations 


X{k) 


x'^ + {y{l + n))' 


:[l-x]-l 


y = y 


A {k) + [1 -y\-l 

x'^ + {y{l + ^)Y^ 


(7) 


Phase diagram 



a 


I I Coexistence 
is possible 

H Coexistence 
is impossible 

H Local attracts 
from {xo,yo) 
a Saddlenode 
bifurcation 
fc I Attractor 
switching 


which has three relevant parameters: A(fc), cr, and Cl. 
Note that by setting Y to zero, we recover the equations 
for identical networks presented in [3]. 

In what follows, we discuss the dynamical properties 
of the system given by Eq. Q. For constant cr, the sys¬ 
tem exhibits a saddle-node bifurcation at a critical value 
of the global connectivity f2c(o’) (see Fig. [^. Above this 
point, coexistence is not possible and the only stable so¬ 
lutions correspond to the domination of either local net¬ 
works or the international one. Both above and below 
the critical value Cider), the basin of attraction of the so¬ 
lution corresponding to the domination of local networks 
decreases with Cl, whereas that of the international net¬ 
work increases (see the rows of Fig. [^. Furthermore, at 
the critical point, the basin of attraction of the interna¬ 
tional network is amplified discontinuously as the region 
of coexistence in the subcritical regime is now merged 
with the basin of attraction of the domination of the in¬ 
ternational network. 

For constant O > 0, the system also exhibits a saddlen¬ 
ode bifurcation at a critical value of the activity affinity 
ac{Cl). In [3] we showed that the system undergoes a 
subcritical pitchfork bifurcation with respect to the con¬ 
trol parameter a, above which no stable coexistence is 
possible, > 0 breaks the symmetry of the pitchfork 
bifurcation and in this case the system undergoes a sad¬ 
dlenode bifurcation with respect to a instead (see bottom 
of Fig. [^. This behavior is well known in bifurcation 
theory and results from adding a small error term to the 
normal form of the pitchfork bifurcation (see Supplemen¬ 
tary Materials). The evolution of the basins of attraction 
is more complex compared to the previous case. Below 
the critical point, both basins of attraction increase with 
a. Above the critical point crc(^)) the basin of attrac¬ 
tion of the local network increases whereas the basin of 
attraction of the international network decreases with a 
(see the columns of Fig. [^. This is particularly interest¬ 
ing as it implies that an intermediate value of the activity 
affinity just slightly above the critical point cr > crdCl) 
represents the worst scenario for the survival of local net¬ 
works, since at this point the size of the basin of attrac¬ 
tion of the domination of the international network is 
maximum. 

In Fig.J^ the blue line indicates the critical line Cider) 
in the a-fl plane, which separates a phase in the parame¬ 
ter space where coexistence is possible (white region) and 


Figure 3. Phase diagram of the double meanfield approxi¬ 
mation for A (k) = 3.5. The white area denotes the parame¬ 
ters for which coexistence is possible. The blue area denotes 
the parameters for which only domination of either local or 
the international network can occur. At the blue line, the 
system undergoes a saddle-node bifurcation in which the sta¬ 
ble coexistence solution disappears. The red line denotes the 
combination of parameters for which the system switches at¬ 
tractors for the initial conditions given by Eq. Q, where we 
use /3 = 0.2 to reflect that the US contributes about 20% 
of the population. The red region shows the parameters for 
which the solution of domination of local networks is reached 
for these initial conditions. Above the red line, the system ap¬ 
proaches the domination of the international network (solid 
red line) or the coexistence solution (dashed red line). 


one in which only domination can occur (blue region). 
However, the increasing size of the basin of attraction of 
the domination of local networks above the critical point 
with respect to ct > ad^l) can dramatically alter the fate 
of the system for a given set of initial conditions. Assume, 
for instance, that the international network dominates in 
the US and starts with a significant delay in each other 
country, which causes the local networks to dominate in 
those countries. At the time when the international net¬ 
work is launched globally, the state of the system can be 
approximated as follows 


a;o = (I - 13) 
yo= /3 1 - 


I - 


X{k) 


X{k) 


( 8 ) 


which we now use as initial conditions to study the fur¬ 
ther evolution. Notice that if one network dominates in 
country i, its activity is given by = I — yjFy- Hence, 
the initial conditions given by Eq. ([^ reflect the fact that 
local networks dominate in the fraction (1 —/3) of the sys¬ 
tem and the international one dominates in the remain¬ 
der. The evolution of the basins of attraction makes the 
system approach different stationary solutions from these 
initial conditions for different parameters. Below the red 
line in Fig. the system approaches the domination of 
local networks starting from the initial conditions given 
in Eq. ([^. Above this line, the system either approaches 
coexistence (white area; crossing dashed red line) or dom¬ 
ination of the international network (blue area; crossing 
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solid red line). This means that in the red region, when 
the international network is launched globally, it is not 
able to overcome the initial advantage of the local net¬ 
works due to its earlier launch. 

To conclude, the double meanfield approximation pre¬ 
dicts that intermediate values of the activity affinity most 
favor the international network; whereas the local net¬ 
works can dominate for a high activity affinity and low 
global connectivity. We confirm these findings by numer¬ 
ical simulations in the following section. 


F. Numerical simulations and synthetic networks 


In this section, we go beyond the meanfield approxima¬ 
tion and study, by means of numerical simulations, the 
effects of the real topology of inter-country social ties and 
of underlying social structures. To this end, we use the 
air travel network (see Fig. and Materials and Meth¬ 
ods) as a proxy for inter-country social ties and construct 
1:1000 scaled synthetic networks to model the structure 
of the 80 countries with most Internet users (see Tab.|T|. 
To generate these networks, we make use of a model in¬ 
troduced in [26U28] . which produces realistic topologies 
of the traditional offline social networks, including het¬ 
erogeneous node degrees and a high level of clustering 
(see Materials and Methods). 

Fig.i shows results from our model for the set of pa¬ 
rameters that best matches empirical observations, as ex¬ 
plained in the following section. The international net¬ 
work starts with a delay in all countries except the US; 
so that initially in these countries the respective local 
network dominates. After some time, the international 
network obtains a significant advantage and quickly takes 
over in most countries. 

To further study the properties of the model presented 
here, we define the relative prevalence of the international 
network compared to local networks as: 


1 ^> 

$ = —y 

r7 _ 


Pi,int I St 


i=l Pz.int 


,loc 


(9) 


where Uf. denotes the number of countries, and p^intlst 


and 


W.loc 


are the activities of the international and lo- 

St 


cal networks in country i in the stationary state. With 
this definition, a value of $ ~ 0 implies that local net¬ 
works dominate in most countries, whereas $ « 1 corre¬ 
sponds to the domination of the international network. 
The relative prevalence of the international network av¬ 
eraged over many realizations is shown in Fig. [^for dif¬ 
ferent values of a as a function of the activity affinity, 
CT, and the launch time delay. At. For small values of 
At, we observe that when cr is small, the international 
and local networks coexist and we observe values around 
$ « 0.5 for the relative prevalence; then, increasing a fa¬ 
vors the international network, which dominates for val¬ 
ues of (T > 0.5 (see Fig. and b). For larger values of 


At, this behavior smoothly translates into a more com¬ 
plex case, which we discuss below. 

We observe in Fig. that for launch time delays 
At > 2, the actual length of the delay becomes irrelevant. 
This behavior corresponds to the limit of saturation of 
the evolution of local networks before the international 
OSN is launched; as discussed in the previous section. 
We consider this limit by averaging over regions with 
At > 2 in Fig.|^, which yields a two dimensional param¬ 
eter space a-a. Indeed, numerical simulations of the full 
model confirm the results from the meanfield analysis; 
in particular the complex role of the activity affinity a. 
For small a and u, local networks and the international 
OSN can coexist. Increasing cr or a favors the domina¬ 
tion of the international network, which gives rise to the 
blue “V”-shaped region around a — 0.5. This corrobo¬ 
rates the saddlenode bifurcation predicted by the double 
meanfield approximation. See supplementary video for 
an explicit realization. For high values of tr and small 
values of a (red region in the bottom right-hand cor¬ 
ner of Fig. 1^, local networks dominate. Note that par¬ 
tial states are also possible, in which the international 
network dominates in some countries and local networks 
dominate in the remaining countries. See supplementary 
video for an explicit realization of this case. 

Between the regions of domination of the international 
network and of local networks, there is a region in which 
the final fate of the system varies significantly between 
different realizations of the model (“coinflip region”). In 
this region, if the international network wins initially in 
the US, it will become dominant globally; otherwise, lo¬ 
cal networks maintain their initial prevalence. Although 
in this region the prevalence of the international network 
averaged over many realizations is about 0.5, as in the co¬ 
existence region in the bottom left-hand corner of Fig. 
the behavior of the system differs dramatically from one 
to another. In the coexistence region, each realization of 
the model leads to the same final state: coexistence of 
local networks and the international OSN. In contrast, 
in the coinflip region, coexistence is not possible, as this 
region of the parameter space corresponds to the super¬ 
critical regime (the blue area in Fig. [^. In the coinflip 
region, about 50% of the realizations end up with domi¬ 
nation of the international network, whereas the remain¬ 
ing 50% lead to the domination of local OSNs. As a 
consequence, even if we know the exact parameters, it is 
impossible to predict the fate of the system beforehand. 

We can summarize these findings as follows. A higher 
value of a, which is a measure of the global connectiv¬ 
ity of society, favors the prevalence of the international 
network and hinders the survival of the local ones. The 
role of the tendency of individuals to participate in more 
active networks (activity affinity), ct, is particularly inter¬ 
esting. Low values allow the networks to coexist, whereas 
intermediate values always lead to the prevalence of the 
international network and the extinction of local OSNs. 
A high activity affinity, however, enables the prevalence 
of local networks and thus can even lead to the extinction 






t=2006 


t=2008 



Figure 4. Evolution of our model averaged over several realizations for the parameters a = 1.25, At = 3, and a = 2 (here, we 
excluded realizations in which local networks dominate, which occurs with approximately 30% probability for these parameters). 
The mapping from model time to real time is explained in the following section. The relative importance of the influence of 
mass media compared to the viral spreading mechanism in governed by the parameter u introduced in Eq. ([^ . In all numerical 
simulations, we set v = 4: the value found empirically in [5]; and A = 0.2 (this corresponds to A/A); ~ 4.3 in [^). The relative 
prevalence of the international network, given by pf,int/(p/,int + p/ioc)i is color coded. We consider the international network to 
be banned in China and Iran. To model this, we set the values of = 0 for each entry which involves one of these countries. 
This is equivalent to assuming that in these countries two local networks compete without any coupling to the rest of the world. 


of the international network. 


G. Comparison with empirical data 

In this section, we compare the results of our model 
with empirical data on the recent expansion of Facebook 
at the cost of many local networks. In particular, we con¬ 
sider the evolution of the number of countries in which 
local networks (i.e. networks that are not Facebook) are 
the most popular ones, as measured in [29j using Alexa 
traffic data (see Fig. We observe a significant de¬ 
cline of this number, which rules out the possibility that 
the empiric case corresponds to the domination of local 
networks. Because the past can be considered a single re¬ 
alization of a stochastic process [30] , the empiric case can 
still be within the coinflip region of our model where -by 
chance- the international network was more successful. 
Hence, we will perform the following comparison only for 


realizations of our model in which local networks do not 
dominate. 

The intrinsic timescale of the model is arbitrary and 
hence has to be mapped to real time. The optimal map¬ 
ping is given such that it produces the best agreement 
with the empirical data. We quantify the agreement be¬ 
tween model results and empirical data using the sum of 
the squared distances between the data points and model 
results. In particular, we use the statistic defined as: 

= ’ ( 10 ) 
i 

where W denotes the number of countries where the lo¬ 
cal network is more popular and is the corre¬ 

sponding result from the model. The index i denotes the 
individual datapoints and cr^ is the estimated variance 
of the data (Supplementary Materials). The real time, 
tn, is a linear function of the model time Im given by 
tR = ai + a 2 tM, where oi is the starting year and 02 
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a=0.5 



0.0 


a=0.75 



0.0 


Figure 5. Relative prevalence of the international network is plotted on the 2 axis as a function of the launch time delay At 
and the coupling strength a. Averaged over 30 realizations. 



a a a 


Figure 6. The prevalence of the international network for a) At = 0, b) At = 0.1, and c) averaged over time delays At > 2 
as a function of the activity affinity (ct) and the global connectivity (a). Averaged over 30 realizations. 


represents the time stretch: how many years of real time 
correspond one model time step. For a given set of pa¬ 
rameters, cr, a, and At, the optimal values for oi and 02 
are those that minimize as shown in Fig. 

We can also use the statistic to estimate the pa¬ 
rameters a, cr, and At which best reproduce the empir¬ 
ical observations. In Fig. [^, we plot the values of y^ 
as a function of a, a, and At, where -at each point- we 
applied the respective best time mapping, as described 
above. These results are averaged over several realiza¬ 
tions of the model; however, in the coinflip region we 
exclude realizations where the local networks dominate, 
to mimic the empirical case. Interestingly, the overall 
best fit is achieved for a = 2 at cr = 1.25 and At = 3, 
which lies in the coinflip region (the optimal value of y^ 


is statistically consistent with the model, given the num¬ 
ber of degrees of freedom in the data) with a probability 
for domination of the international network of 70%. This 
scenario corresponds to the time mapping oi = 2006 and 
02 = 0.6, meaning the system started at the beginning 
of 2006; while the launch time delay of At = 3 in the 
model translates to 1.8 years in real time. In Fig. 
we show the evolution of the number of countries where 
local networks are more popular for the optimal fit from 
the model. 
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Figure 7. Comparison between model results and empirical data, a) for different values of ai (year) and 02 (time stretch) 
for a = 1.25, a = 2, and At = 3. b) Model results and empirical data. Here, the parameters are a = 1.25, a = 2, and At = 3 
and the optimal mapping is given by oi = 2006 and 02 = 0.6 (see a), c) x^ for the respective best time mapping as a function 
of a and At for different values of a. In each plot, the minimal value of x^ and the respective time mapping are shown in the 
boxes. The color coding in all plots represents the logarithm of ■ 


III. DISCUSSION 

Understanding the complex dynamics of the digital 
world constitutes an important challenge for interdisci¬ 
plinary science. To meet this challenge, here we describe 
the the worldwide web as a complex, digital ecosystem 
in which interacting networks play the role of species in 
competition for survival. In particular, we study the com¬ 
petition between local networks operating in single coun¬ 
tries and an international network that operates in all 
countries. Therefore, a proper description of this system 
must necessarily involve the network of worldwide social 
interactions between different countries. 

We show that the effect of inter-country social ties can 
be mapped to the increased fitness of the international 
network by means of an effective activity. Interestingly, 
there is a critical global coupling strength below which 
networks can coexist. However, above that threshold, 
only domination is possible: in general, local networks 
become extinct with a high probability. Yet, we find that 
if local networks are launched earlier they can persist and 
dominate the international network, which happens only 
if local networks have accumulated a sufficiently large ac¬ 
tive userbase when the global launch of the international 


network takes place. The accumulation of a sufficient 
base depends on the parameters; and for certain param¬ 
eters on chance. For these parameters the final state of 
the system -whether local networks dominate or become 
extinct- can be completely unpredictable, as it varies ran¬ 
domly between different realizations of the model. 

Quite remarkably, a thorough comparison of our model 
with empirical data from the recent takeover of Facebook 
indicates that the most probable launch date of Facebook 
was at the beginning of 2006 and its global launch was 
in late 2007. Facebook was in fact started in 2004, but 
opened to the public in 2006; in good agreement with 
the estimate from our model. Moreover, according to 
Google trend data (see Supplementary Materials), 2007 
was the year when the global search volume for Facebook 
started to increase rapidly. Last but not least, our best 
estimation of the model parameters corresponds to the 
“coinflip” region, which means that the observed takeover 
of Facebook only had a probability of around 70%. With 
a 30% probability, we would have been living in a world 
where each country had its own successful local network 
and a network like Facebook would not exist [50] . 

Our findings suggest interesting future lines of re¬ 
search. On the one hand, even without adjusting the 
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parameters on a country-by-country level, our model re¬ 
produces the main features empirically observed in the 
takeover of Facebook and the extinction of local networks 
in most countries for a certain parameter region. It re¬ 
mains an interesting task for future research to further 
increase the precision of the model. This could be done 
by improving the proxy for the similarity between coun¬ 
tries or by adjusting parameters on a country-by-country 
basis. On the other hand, the model could be extended to 
account for several international networks and to study 
their global competition. For a second international net¬ 
work to overcome the first, a certain minimal difference 
of fitness is needed; which could be the result of different 
properties of the networks, such as features or function¬ 
alities. Finally, random fluctuations of fitness could be 
incorporated to describe Darwinian selection in the digi¬ 
tal ecosystem. 


IV. METHODS AND MATERIAL 


Here, we use an exponential distribution p^{k) = 
with ^ = 10. We set the parameters a = 1.5 and fi = 
0.02. After generating the networks, we remove nodes 
with zero degree. Fig. shows the degree distribution 
and the clustering spectrum for the synthetic network 
created for the US. 

B. Air travel data 

Air travel data aggregated on a country basis were 
taken from http://visualising.org/datasets/global- 
flights-network (date of access July 2014). 
The original data can be accessed at 
http://openflights.org/data.html. The network on 
a country basis incorporates 230 nodes and 4600 
weighted edges which correspond to the number of 
routes between countries, i.e. the number of total flights. 
The dataset contains around 60.000 of such flights. We 
extract the subnetwork (see Fig. &) by constraining to 
the countries listed in Tab. HI 


A. SI model 


C. Supplementary Material 


We use the SI model [55H2H] to generate the synthetic 
networks for the underlying societies in each country. 
The model allows us to specify the degree distribution 
and the level of clustering. The model is based on a cir¬ 
cle as a hidden metric space and works as follows: 

1. All nodes are placed on the circle with a randomly 
assigned variable, 9, which represents the polar co¬ 
ordinate. 9 is uniformly distributed in [0, 27r). To 
keep the average node density on the circle con¬ 
stant, its radius grows linearly with the number of 
nodes, to satisfy N = 2-kR. 

2. We assign each node a second hidden variable, k, 
which represents its expected degree, k is drawn 
from an arbitrary distribution p(k). 

3. A pair of nodes is connected with a probability, r, 
that depends on their hidden variables (0, k) and 

(0',k'): 


r(0, k; 9', k') 



d{9,9') 

flKn' 


( 11 ) 


with Here, d{9,9') denotes the geodesic 

distance between the two nodes on the circle and 
{k) the mean degree. Then, the expected degree, 
^(k), of a node with hidden variable n can be shown 
to be proportional to k [28]. As a consequence, the 
degree distribution, p{k), of the network follows the 
shape of the distribution p{k). 


Supplementary Material is avail¬ 
able at: http://www.nature.com/article- 

assets / npg/srep/2016/160427/srep25116/extref/srep25116- 
s3.pdf 
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Figure 8. Degree distribution and clustering spectrum for 
networks generated for the example of the US (« 230.000 
nodes). 
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Ghina 

253. 

UnitedStates 

231. 

Japan 

90.9 

India 

81. 

Brazil 

64.9 

Germany 

62. 

U nitedKingdom 

48.8 

Russia 

45.2 

France 

42.9 

SouthKorea 

37.5 

Indonesia 

30. 

Spain 

25.2 

Canada 

25.1 

Italy 

25. 

Turkey 

24.5 

Mexico 

23.3 

Iran 

23. 

Vietnam 

20.8 

Poland 

18.7 

Pakistan 

18.5 

Golombia 

17.1 

Malaysia 

16.9 

Thailand 

16.1 

Australia 

15.2 

Taiwan 

15.1 

Netherlands 

14.3 

Egypt 

11.4 

Argentina 

11.2 

Nigeria 

11. 

Ukraine 

10.4 

Morocco 

10.3 

Sweden 

8.1 

SaudiArabia 

7.7 

Belgium 

7.3 

Venezuela 

7.2 

Peru 

7.1 

Romania 

6.1 

CzechRepublic 

6. 

Austria 

5.9 

Hungary 

5.9 

Switzerland 

5.7 

Philippines 

5.6 

Chile 

5.5 

Denmark 

4.6 

Portugal 

4.5 

Finland 

4.4 

Greece 

4.3 

Sudan 

4.2 

SouthAfrica 

4.2 

HongKong 

4.1 

Algeria 

4.1 

Norway 

3.9 

Slovakia 

3.6 

Syria 

3.6 

Singapore 

3.4 

Kenya 

3.4 

Belarus 

3.1 

NewZealand 

3. 

Serbia 

2.9 

U nited ArabEmirates 

2.9 

Ireland 

2.8 

Tunisia 

2.8 

Bulgaria 

2.6 

Uganda 

2.5 

Uzbekistan 

2.5 

Kazakhstan 

2.3 

Lebanon 

2.2 

DominicanRepublic 

2.1 

Israel 

2.1 

Guatemala 

2. 

Groatia 

1.9 

Lithuania 

1.8 

Jamaica 

1.5 

Jordan 

1.5 

Azerbaijan 

1.5 

GostaRica 

1.5 

Cuba 

1.4 

Zimbabwe 

1.4 

Uruguay 

1.3 

Ecuador 

1.3 


Table I. List of countries and estimated number of Internet users (xlO®) according to Wolfram Alpha database. 
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